Detection and removal of barcode swapping in single-cell RNA-seq data

نویسندگان

  • Jonathan A. Griffiths
  • Aaron T.L. Lun
  • Arianne C. Richard
  • Karsten Bach
  • John C Marioni
چکیده

Multiplexing is a widely-used procedure that allows multiple DNA libraries to be pooled together for efficient sequencing. However, recent reports suggest that the DNA barcodes that label different libraries can “swap” on patterned flow-cell Illumina sequencing machines, including the HiSeq 4000, HiSeq X, and NovaSeq, thereby mislabelling molecules [1, 2]. This may compromise many types of -omic assays, but it is particularly problematic for single-cell RNA-seq (scRNA-seq), where many libraries are multiplexed together. A number of widely used plate-based scRNA-seq library preparation methods isolate and process individual cells in wells of a microwell plate, before performing library preparation in parallel [3]. A unique combination of sample barcodes labels the library of each cell, typically with one barcode at each end of a cDNA molecule. One barcode provides a row index for each cell on the microwell plate and the other barcode provides a column index. Barcode swapping therefore moves transcripts between cells. We generated a dataset (see Supplementary Files, “Richard data”) where two plates of single-cell libraries were multiplexed for sequencing on the HiSeq 4000 using two mutually exclusive barcode sets. We expect to only observe reads

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data

Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...

متن کامل

A Graph-Based Clustering Approach to Identify Cell Populations in Single-Cell RNA Sequencing Data

Introduction: The emergence of single-cell RNA-sequencing (scRNA-seq) technology has provided new information about the structure of cells, and provided data with very high resolution of the expression of different genes for each cell at a single time. One of the main uses of scRNA-seq is data clustering based on expressed genes, which sometimes leads to the detection of rare cell populations. ...

متن کامل

I-13: Transcriptome Dynamics of Human and Mouse Preimplantation Embryos Revealed by Single Cell RNA-Sequencing

Background: Mammalian preimplantation development is a complex process involving dramatic changes in the transcriptional architecture. However, it is still unclear about the crucial transcriptional network and key hub genes that regulate the proceeding of preimplantation embryos. Materials and Methods: Through single-cell RNAsequencing (RNA-seq) of both human and mouse preimplantation embryos, ...

متن کامل

scPipe: a flexible data preprocessing pipeline for single-cell RNA-sequencing data

Single-cell RNA sequencing (scRNA-seq) technology allows researchers to profile the transcriptomes of thousands of cells simultaneously. Protocols that incorporate both designed and random barcodes to label individual cells and molecules have greatly increased the throughput of scRNA-seq, but give rise to a more complex data structure. There is a need for new tools that can handle the various b...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017